43 research outputs found

    Multiplexed profiling of RNA and protein expression signatures in individual cells using flow or mass cytometry

    Get PDF
    Advances in single-cell analysis technologies are providing novel insights into phenotypic and functional heterogeneity within seemingly identical cell populations. RNA within single cells can be analyzed using unbiased sequencing protocols or through more targeted approaches using in situ hybridization (ISH). The proximity ligation assay for RNA (PLAYR) approach is a sensitive and high-throughput technique that relies on in situ and proximal ligation to measure at least 27 specific RNAs by flow or mass cytometry. We provide detailed instructions for combining this technique with antibody-based detection of surface/internal protein, allowing simultaneous highly multiplexed profiling of RNA and protein expression at single-cell resolution. PLAYR overcomes limitations on multiplexing seen in previous branching DNA–based RNA detection techniques by integration of a transcript-specific oligonucleotide sequence within a rolling-circle amplification (RCA). This unique transcript-associated sequence can then be detected by heavy metal (for mass cytometry)- or fluorophore (for flow cytometry)-conjugated complementary detection oligonucleotides. Included in this protocol is methodology to label oligonucleotides with lanthanide metals for use in mass cytometry. When analyzed by mass cytometry, up to 40 variables (with scope for future expansion) can be measured simultaneously. We used the described protocol to demonstrate intraclonal heterogeneity within primary cells from chronic lymphocytic leukemia patients, but it can be adapted to other primary cells or cell lines in suspension. This robust, reliable and reproducible protocol can be completed in 2–3 d and can be paused at several stages for convenience

    Structural motifs recurring in different folds recognize the same ligand fragments

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The structural analysis of protein ligand binding sites can provide information relevant for assigning functions to unknown proteins, to guide the drug discovery process and to infer relations among distant protein folds. Previous approaches to the comparative analysis of binding pockets have usually been focused either on the ligand or the protein component. Even though several useful observations have been made with these approaches they both have limitations. In the former case the analysis is restricted to binding pockets interacting with similar ligands, while in the latter it is difficult to systematically check whether the observed structural similarities have a functional significance.</p> <p>Results</p> <p>Here we propose a novel methodology that takes into account the structure of both the binding pocket and the ligand. We first look for local similarities in a set of binding pockets and then check whether the bound ligands, even if completely different, share a common fragment that can account for the presence of the structural motif. Thanks to this method we can identify structural motifs whose functional significance is explained by the presence of shared features in the interacting ligands.</p> <p>Conclusion</p> <p>The application of this method to a large dataset of binding pockets allows the identification of recurring protein motifs that bind specific ligand fragments, even in the context of molecules with a different overall structure. In addition some of these motifs are present in a high number of evolutionarily unrelated proteins.</p

    FunClust: a web server for the identification of structural motifs in a set of non-homologous protein structures

    Get PDF
    The occurrence of very similar structural motifs brought about by different parts of non homologous proteins is often indicative of a common function. Indeed, relatively small local structures can mediate binding to a common partner, be it a protein, a nucleic acid, a cofactor or a substrate. While it is relatively easy to identify short amino acid or nucleotide sequence motifs in a given set of proteins or genes, and many methods do exist for this purpose, much more challenging is the identification of common local substructures, especially if they are formed by non consecutive residues in the sequence

    Discriminative structural approaches for enzyme active-site prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Predicting enzyme active-sites in proteins is an important issue not only for protein sciences but also for a variety of practical applications such as drug design. Because enzyme reaction mechanisms are based on the local structures of enzyme active-sites, various template-based methods that compare local structures in proteins have been developed to date. In comparing such local sites, a simple measurement, RMSD, has been used so far.</p> <p>Results</p> <p>This paper introduces new machine learning algorithms that refine the similarity/deviation for comparison of local structures. The similarity/deviation is applied to two types of applications, single template analysis and multiple template analysis. In the single template analysis, a single template is used as a query to search proteins for active sites, whereas a protein structure is examined as a query to discover the possible active-sites using a set of templates in the multiple template analysis.</p> <p>Conclusions</p> <p>This paper experimentally illustrates that the machine learning algorithms effectively improve the similarity/deviation measurements for both the analyses.</p

    CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB), thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm), has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods.</p> <p>Results</p> <p>The evaluation of CMASA shows that the CMASA is highly accurate (0.96), sensitive (0.86), and fast enough to be used in the large-scale functional annotation. Comparing to both sequence-based and global structure-based methods, not only the CMASA can find remote homologous proteins, but also can find the active site convergence. Comparing to other local structure comparison-based methods, the CMASA can obtain the better performance than both FFF (a method using geometry to predict protein function) and SPASM (a local structure alignment method); and the CMASA is more sensitive than PINTS and is more accurate than JESS (both are local structure alignment methods). The CMASA was applied to annotate the enzyme catalytic sites of the non-redundant PDB, and at least 166 putative catalytic sites have been suggested, these sites can not be observed by the Catalytic Site Atlas (CSA).</p> <p>Conclusions</p> <p>The CMASA is an accurate algorithm for detecting local protein structural similarity, and it holds several advantages in predicting enzyme active sites. The CMASA can be used in large-scale enzyme active site annotation. The CMASA can be available by the mail-based server (<url>http://159.226.149.45/other1/CMASA/CMASA.htm</url>).</p

    Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies

    Get PDF
    In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life

    The LabelHash algorithm for substructure matching

    Get PDF
    Background: There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. However, experimental protein function determination is expensive and very time-consuming. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Results: We present LabelHash, a novel algorithm for matching substructural motifs to large collections of protein structures. The algorithm consists of two phases. In the first phase the proteins are preprocessed in a fashion that allows for instant lookup of partial matches to any motif. In the second phase, partial matches for a given motif are expanded to complete matches. The general applicability of the algorithm is demonstrated with three different case studies. First, we show that we can accurately identify members of the enolase superfamily with a single motif. Next, we demonstrate how LabelHash can complement SOIPPA, an algorithm for motif identification and pairwise substructure alignment. Finally, a large collection of Catalytic Site Atlas motifs is used to benchmark the performance of the algorithm. LabelHash runs very efficiently in parallel; matching a motif against all proteins in the 95 % sequence identity filtered non-redundant Protein Data Bank typically takes no more than a few minutes. The LabelHash algorithm is available through a web server and as a suite of standalone programs a

    Borrelia burgdorferi Requires the Alternative Sigma Factor RpoS for Dissemination within the Vector during Tick-to-Mammal Transmission

    Get PDF
    While the roles of rpoSBb and RpoS-dependent genes have been studied extensively within the mammal, the contribution of the RpoS regulon to the tick-phase of the Borrelia burgdorferi enzootic cycle has not been examined. Herein, we demonstrate that RpoS-dependent gene expression is prerequisite for the transmission of spirochetes by feeding nymphs. RpoS-deficient organisms are confined to the midgut lumen where they transform into an unusual morphotype (round bodies) during the later stages of the blood meal. We show that round body formation is rapidly reversible, and in vitro appears to be attributable, in part, to reduced levels of Coenzyme A disulfide reductase, which among other functions, provides NAD+ for glycolysis. Our data suggest that spirochetes default to an RpoS-independent program for round body formation upon sensing that the energetics for transmission are unfavorable

    One origin for metallo-Ξ²-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees

    Get PDF
    This work was supported by BBSRC (grant BB/F016778/1)Bacteria use metallo-Ξ²-lactamase enzymes to hydrolyse lactam rings found in many antibiotics, rendering them ineffective. Metallo-Ξ²-lactamase activity is thought to be polyphyletic, having arisen on more than one occasion within a single functionally diverse homologous superfamily. Since discovery of multiple origins of enzymatic activity conferring antibiotic resistance has broad implications for the continued clinical use of antibiotics, we test the hypothesis of polyphyly further; if lactamase function has arisen twice independently, the most recent common ancestor (MRCA) is not expected to possess lactam-hydrolysing activity. Two major problems present themselves. Firstly, even with a perfectly known phylogeny, ancestral sequence reconstruction is error prone. Secondly, the phylogeny is not known, and in fact reconstructing a single, unambiguous phylogeny for the superfamily has proven impossible. To obtain a more statistical view of the strength of evidence for or against MRCA lactamase function, we reconstructed a sample of 98 MRCAs of the metallo-Ξ²-lactamases, each based on a different tree in a bootstrap sample of reconstructed phylogenies. InterPro sequence signatures and homology modelling were then used to assess our sample of MRCAs for lactamase functionality. Only 5 % of these models conform to our criteria for metallo-Ξ²-lactamase functionality, suggesting that the ancestor was unlikely to have been a metallo-Ξ²-lactamase. On the other hand, given that ancestral proteins may have had metallo-Ξ²-lactamase functionality with variation in sequence and structural properties compared with extant enzymes, our criteria are conservative, estimating a lower bound of evidence for metallo-Ξ²-lactamase functionality but not an upper bound.Publisher PDFPeer reviewe
    corecore